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4 <110> APPLICANT: Matsui, Ikuo 

5 Ishikawa, Kazuhiko 

6 Ishida, Hiroyasu 

7 Kosugi, Yoshitsugu 

9 <120> TITLE OF INVENTION: THERMOPHILIC ENZYMES 
10 BETA-GLYCOSIDASE ACTIVITY 

12 <130> FILE REFERENCE: 11059/002001 

14 <140> ■ CURRENT APPLICATION NUMBER: 09/369, 7 3 5A 

15 <141> CURRENT FILING DATE: 1999-08-06 
17 <160> NUMBER OF SEQ ID NOS : 10 

19 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

21 <210> SEQ ID NO: 1 

22 <211> LENGTH: 1269 

23 <212> TYPE: DNA 

Pyrococcus horikoshii 



TEBE 



24 
26 



<213> ORGANISM 
<220> FEATURE: 

27 <221> NAME/KEY 

28 <222> LOCATION 
<400> SEQUENCE 



30 



CDS 

(1). . .(1269) 
1 

31 atg ccg ctg aaa ttc ccg gaa atg ttt etc ttt ggt 

32 Met Pro Leu Lys Phe Pro Glu Met Phe Leu Phe Gly 

33 1 5 10 

35 tec eat cag ata gag gga aat aat aga tgg aat gat 

36 Ser His Gin He Glu Gly Asn Asn Arg Trp Asn Asp 

37 20 25 

39 gag cag att gga aag etc cce tac aga tct ggt aag 

40 Glu Gin He Gly Lys Leu Pro Tyr Arg Ser Gly Lys 
35 40 

tgg gaa ctt tac agg gat gat att cag eta atg acc 
Trp Glu Leu Tyr Arg Asp Asp He Gin Leu Met Thr 
50 55 60 

aat get tat agg ttc tec ata gag tgg age agg eta 

48 Asn Ala Tyr Arg Phe Ser He Glu Trp Ser Arg Leu 

49 65 70 75 

51 aat aaa ttt aat gaa gat get ttc atg aaa tac egg 

52 Asn Lys Phe Asn Glu Asp Ala Phe Met Lys Tyr Arg 

53 85 90 

55 ttg tta ttg acg aga ggt ata act cce ctg gtg acc 

56 Leu Leu Leu Thr Arg Gly He Thr Pro Leu Vai Thr 



41 
43 
44 
45 
47 



acc gea aca 
Thr Ala Thr 
15 

tgg tgg tac 
Trp Trp Tyr 
30 

get tgc aat 
Ala Cys Asn 
45 

age ttg ggc 
Ser Leu Gly 

ttc eca gag 
Phe Pro Glu 



57 



100 



105 



59 act age ect etc tgg ttc atg aag aaa ggt ggc ttc 



60 
61 



Thr Ser Pro Leu Trp Phe Met Lys Lys Gly Gly Phe 
115 120 

63 aae eta aaa eat tgg gaa aag tac ata gaa aag gtt 

64 Asn Leu Lys His Trp Glu Lys Tyr He Glu Lys Val 
130 135 140 

gaa aaa gtt aaa eta gta get ace ttc aat gag ccg 



65 
67 



gag 
Glu 

eta 
Leu 

ctt 
Leu 
125 
get 
Ala 



att ata 
He He 
95 

eac eac 
His His 
110 

agg gag 
Arg Glu 

gag ctt 
Glu Leu 



tea 
Ser 

tat 
Tyr 

cac 
His 

tat 
Tyr 

gaa 
Glu 
80 
gac 
Asp 

ttt 
Phe 

gag 
Glu 

tta 
Leu 



atg gta tac gta 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 
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68 Glu Lys Val Lys Leu Val Ala Thr Phe Asn Glu Pro Met Val Tyr Val 

69 145 150 155 160 

71 atg atg gga tat eta acg get tat tgg ecc eea ttc att agg agt cea 528 

72 Met Met Gly Tyr Leu Thr Ala Tyr Trp Pro Pro Phe lie Arg Ser Pro 

73 165 170 175 

75 ttt aag gee ttt aag gta get gca aae ctg ett aaa get cae gca att 57 6 

76 Phe Lys Ala Phe Lys Val Ala Ala Asn Leu Leu Lys Ala His Ala lie 

77 180 185 190 

79 gcc tat gaa ett ett eat ggg aaa tte aaa gtt gga ate gta aag aat 624 

80 Ala Tyr Glu Leu Leu His Gly Lys Phe Lys Val Gly He Val Lys Asn 

81 195 200 205 

83 att ecc ata ata etc cca geg agt gac aag gag agg gat aga aaa gcc 672 

84 He Pro He He Leu Pro Ala Ser Asp Lys Glu Arg Asp Arg Lys Ala 

85 210 215 220 

87 get gag aaa get gat aat tta ttt aac tgg eac ttt ttg gat geg ata 720 

88 Ala Glu Lys Ala Asp Asn Leu Phe Asn Trp His Phe Leu Asp Ala He 
'89 225 230 235 240 

91 tgg agt ggg aaa tac aga ggg gta ttt aaa aca tat agg att eec caa 768 

92 Trp Ser Gly Lys Tyr Arg Gly Val Phe Lys Thr Tyr Arg He Pro Gin 

93 245 250 255 

95 agt gac gca gat tte att ggg gtt aac tat tac acg gee age gaa gta 816 

96 Ser Asp Ala Asp Phe He Gly Val Asn Tyr Tyr Thr Ala Ser Glu Val 

97 260 265 270 

99 agg cat act tgg aat cct tta aaa ttc ttc ttt gag gtg aaa tta geg 864 

100 Arg His Thr Trp Asn Pro Leu Lys Phe Phe Phe Glu Val Lys Leu Ala 

101 275 280 285 

103 gat att age gag agg aag act caa atg gga tgg age gtt tat cca aaa 912 

104 Asp He Ser Glu Arg Lys Thr Gin Met Gly Trp Ser Val Tyr Pro Lys 

105 290 295 300 

107 gga ata tac atg gcc ett aaa aaa get tec agg tat gga agg cct ett 960 

108 Gly He Tyr Met Ala Leu Lys Lys Ala Ser Arg Tyr Gly Arg Pro Leu 

109 305 310 315 320 

111 tat att acg gaa aae gga ata geg acg ett gat gat gaa tgg aga gtg 1008 

112 Tyr He Thr Glu Asn Gly He Ala Thr Leu Asp Asp Glu Trp Arg Val 

113 325 330 335 

115 gaa ttc ata att caa cae etc caa tac gtt cat aag get ate gaa gac 1056 

116 Glu Phe He He Gin His Leu Gin Tyr Val His Lys Ala He Glu Asp 

117 340 345 350 

119 ggc ctg gat gta aga ggt tac ttc tat tgg tea ttt atg gat aac tac 1104 

120 Gly Leu Asp Val Arg Gly Tyr Phe Tyr Trp Ser Phe Met Asp Asn Tyr 

121 355 360 365 

123 gag tgg aaa gag ggg ttt ggg cct aga ttt ggc eta gtg gaa gtt gat 1152 

124 Glu Trp Lys Glu Gly Phe Gly Pro Arg Phe Gly Leu Val Glu Val Asp 

125 370 375 380 

127 tat caa ace ttc gag aga agg cee agg aag agt get tac gta tac gga 1200 

128 Tyr Gin Thr Phe Glu Arg Arg Pro Arg Lys Ser Ala Tyr Val Tyr Gly 

129 385 390 395 400 

131 gaa att gca aga agt aag gaa ata aag gat gag eta tta aag aga tat 1248 

132 Glu He Ala Arg Ser Lys Glu He Lys Asp Glu Leu Leu Lys Arg Tyr 
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133 405 

13 5 ggc eta cca gaa ctt caa ctt 

136 Gly Leu Pro Glu Leu Gin Leu 

137 420 



410 



415 



1269 



140 <210> SEQ ID NO: 2 

141 <211> LENGTH: 423 

142 <212> TYPE: PRT 

143 <213> ORGANISM: Pyrococcus horikoshii 

145 <4 00> SEQUENCE: 2 

146 Met Pro Leu Lys Phe Pro Glu Met Phe Leu Phe Gly Thr Ala Thr Ser 

147 15 10 15 

148 Ser His Gin lie Glu Gly Asn Asn Arg Trp Asn Asp Trp Trp Tyr Tyr 

149 20 25 30 

150 Glu Gin lie Gly Lys Leu Pro Tyr Arg Ser Gly Lys Ala Cys Asn His 

151 35 40 45 

152 Trp Glu Leu Tyr Arg Asp Asp lie Gin Leu Met Thr Ser Leu Gly Tyr 

153 50 55 60 

154 Asn Ala Tyr Arg Phe Ser lie Glu Trp Ser Arg Leu Phe Pro Glu Glu 

155 65 70 75 80 

156 Asn Lys Phe Asn Glu Asp Ala Phe Met Lys Tyr Arg Glu lie lie Asp 

157 85 90 95 

158 Leu Leu Leu Thr Arg Gly He Thr Pro Leu Val Thr Leu His His Phe 

159 100 105 110 

160 Thr Ser Pro Leu Trp Phe Met Lys Lys Gly Gly Phe Leu Arg Glu Glu 

161 115 120 125 

162 Asn Leu Lys His Trp Glu Lys Tyr He Glu Lys Val Ala Glu Leu Leu 

163 130 135 140 

164 Glu Lys Val Lys Leu Val Ala Thr Phe Asn Glu Pro Met Val Tyr Val 

165 145 - 150 155 160 

166 Met Met Gly Tyr Leu Thr Ala Tyr Trp Pro Pro Phe He Arg Ser Pro 

167 165 170 175 

168 Phe Lys Ala Phe Lys Val Ala Ala Asn Leu Leu Lys Ala His Ala He 

169 180 185 190 

170 Ala Tyr Glu Leu Leu His Gly Lys Phe Lys Val Gly He Val Lys Asn- 

171 195 200 205 

172 He Pro He He Leu Pro Ala Ser Asp Lys Glu Arg Asp Arg Lys Ala 

173 210 215 220 

174 Ala Glu Lys Ala Asp Asn Leu Phe Asn Trp His Phe Leu Asp Ala He 

175 225 230 235 240 

176 Trp Ser Gly Lys Tyr Arg Gly Val Phe Lys Thr Tyr Arg He Pro Gin 

177 245 250 255 . 

178 Ser Asp Ala Asp Phe He Gly Val Asn Tyr Tyr Thr Ala Ser Glu Val 

179 260 265 270 

180 Arg His Thr Trp Asn Pro Leu Lys Phe Phe Phe Glu Val Lys Leu Ala 

181 275 280 285 

182 Asp He Ser Glu Arg Lys Thr Gin Met Gly Trp Ser Val Tyr Pro Lys 

183 290 295 300 

184 Gly He Tyr Met Ala Leu Lys Lys Ala Ser Arg Tyr Gly Arg Pro Leu 

185 305 310 315 320 
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186 


Tyr 


He 


Thr 


Glu 


Asn 


Gly 


He 


Ala 


Thr 


Leu Asp Asp Glu 


Trp 


Arg 


Val 


187 










325 










330 


335 




loo 


Glu 


Phe 


He 


He 


Gin 


His 


Leu 


Gin 


Tyr 


Val His Lys Ala 


He 


Glu 


Asp 


189 








340 










345 




350 




190, 


Gly 


Leu 


Asp 


Val 


Arg 


Gly 


Tyr 


Phe 


Tyr 


Trp Ser Phe Met 


Asp 


Asn 


Tyr 


191 






355 










360 




365 




192 


Glu 


Trp Lys 


Glu Gly 


Phe 


Gly 


Pro 


Arg 


Phe Gly Leu Val 


Glu 


Val 


Asp 


193 




370 










375 






380 






194 


Tyr 


Gin 


Thr 


Phe 


Glu 


Arg 


Arg 


Pro 


Arg 


Lys Ser Ala Tyr 


Val 


Tyr 


Gly 


195 


385 










390 








395 




400 


196 


Glu 


He 


Ala 


Arg 


Ser 


Lys 


Glu 


He 


Lys 


Asp Glu Leu Leu 


Lys 


Arg 


Tyr 


197 










405 










410 


415 


198 


Gly 


Leu 


Pro 


Glu 


Leu 


Gin 


Leu 















199 420 

201 <210> SEQ ID NO: 3 

202 <211> LENGTH: 57 

203 <212> TYPE: DNA 

204 <213> ORGANISM: Artificial Sequence 

206 <220> FEATURE: 

207 <223> OTHER INFORMATION: An upper primer designed to create the Ndel site 

209 <400> SEQUENCE: 3 

210 taagaaggag atatacatat gccgctgaaa ttcccggaaa tgtttctctt tggtacc 57 

212 <210> SEQ ID NO: 4 

213 <211> LENGTH: 46 

214 <212> TYPE: DNA 

215 <213> ORGANISM: Artificial Sequence 

217 <220> FEATURE: 

218 <223> OTHER INFORMATION: A lower primer designed to create the BamHI site 

220 <400> SEQUENCE: 4 

221 tttactgcag agaggatccc taatcctaaa gttgaagttc tggtag 46 

223 <210> SEQ ID NO: 5 

224 <211> LENGTH: 423 



225 


<212> TYPE: PRT 






















226 


<213> ORGANISM: 


Pyrococcus 


horikoshii 












228 


<400> SEQUENCE: 


5 




















229 


Met Pro Leu Lys 


Phe 


Pro Glu 


Met 


Phe 


Leu 


Phe Gly 


Thr 


Ala 


Thr 


Ser 


230 


1 


5 








10 






15 




231 


Ser Lys Cys He 


Glu 


Gly Asn 


Asn 


Arg 


Trp 


Asn Cys 


Trp 


Trp 


Tyr 


Tyr 


232 


20 








25 








30 


233 


Glu Gin He Gly 


Lys 


Leu Pro 


Tyr 


Arg 


Ser 


Gly Lys 


Ala 


Cys 


Asn 


His 


234 


35 






40 






45 






235 


Trp Glu Leu Tyr 


Arg 


Asp Asp 


He 


Gin 


Leu 


Met Thr 


Ser 


Leu 


Gly 


Tyr 


236 


50 




55 








60 






237 


Asn Ala Tyr Arg 


Phe 


Ser He 


Glu 


Trp 


Ser 


Arg Leu 


Phe 


Pro 


Glu 


Glu 


238 


65 




70 








75 








80 


239 


Asn Lys Phe Met 


Glu 


Asp Ala 


Phe 


Met 


Lys 


Tyr Arg 


Glu 


He 


He 


Asp 


240 




85 








90 








95 


241 


Leu Leu Leu Thr 


Phe 


Gly He 


Thr 


Pro 


Leu 


Val Thr 


Leu 


His 


His 


Phe 


242 


100 








105 








110 
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243 


Thr 


Ser 


Pro 


Leu 


Trp 


Phe 


Met 


Lys 


Lys 


Gly Gly 


Phe 


Leu 


Arg 


Glu 


Glu 


244 






115 










120 










125 






24 5 


Asn 


Leu 


Lys 


His 


Trp Glu 


Lys 


Tyr 


He 


Glu 


Lys 


Val 


Ala 


Glu 


Leu 


Leu 


246 




130 










135 










140 










247 


Glu 


Lys 


Val 


Lys 


Leu 


Val 


Ala 


Thr 


Phe 


Asn 


Glu 


Pro 


Met 


Val 


Tyr 


Val 


248 


145 










150 










155 








160 


24 9 


Met Met Gly 


Tyr 


Leu 


Thr 


Ala 


Tyr 


Trp 


Pro 


Pro 


Phe 


He Arg 


Ser 


Pro 


250 










165 










170 










175 




251 


Phe 


Lys 


Ala 


Phe 


Lys 


Val 


Ala 


Ala 


Asn 


Leu 


Leu 


Lys 


Ala 


His 


Ala 


He 


252 








180 










185 










190 






253 


Ala 


Tyr 


Glu 


Leu 


Leu 


His 


Gly 


Lys 


Phe 


Lys 


Val 


Gly 


He 


Val 


Lys 


Asn 


254 






195 










200 










205 






255 


He 


Pro 


He 


He 


Leu 


Pro 


Ala 


Ser 


Asp 


Lys 


Glu 


Arg Asp 


Arg 


Lys 


Ala 


256 




210 










215 










220 






257 


Ala 


Glu 


Lys 


Ala 


Asp 


Asn 


Leu 


Phe 


Asn 


Trp 


His 


Phe 


Leu 


Asp Ala 


He 


258 


225 










230 










235 










240 


259 


Trp 


Ser 


Gly 


Lys 


Tyr Arg Gly 


Val 


Phe 


Lys 


Thr 


Tyr 


Arg 


He 


Pro 


Gin 


260 










245 










250 










255 




261 


Ser 


Asp 


Ala 


Asp 


Phe 


He Gly Met Asn 


Tyr 


Tyr 


Thr 


Ala 


Ser 


Glu 


Val 


262 








260 










265 










270 






263 


Arg 


His 


Thr 


Trp 


Asn 


Pro 


Leu 


Lys 


Phe 


Phe 


Phe 


Glu 


Val 


Lys 


Leu 


Ala 


264 






275 










280 










285 






265 


Asp 


He 


Ser 


Glu 


Arg 


Lys 


Thr 


Gin 


Met Gly 


Trp 


Ser 


Val 


Tyr 


Pro 


Lys 


266 




290 










295 










300 






267 


Gly 


He 


Tyr 


Met 


Ala 


Leu 


Lys 


Lys 


Ala 


Ser 


Pro 


Tyr Gly Arg 


Pro 


Leu 


268 


305 










310 










315 










320 


269 


Tyr 


He 


Thr 


Glu 


Asn 


Gly 


He 


Ala 


Thr 


Leu 


Asp 


Asp 


Glu 


Trp Arg Val 


270 










325 










330 










335 




271 


Glu 


Phe 


He 


He 


Gin 


His 


Leu 


Gin 


Tyr 


Val 


His 


Lys 


Ala 


He 


Glu 


Asp 


272 








340 










345 










350 




273 


Gly 


Leu 


Asp 


Val 


Arg 


Gly 


Tyr 


Phe 


Tyr 


Trp 


Ser 


Phe 


Met 


Asp Asn 


Tyr 


274 






355 










360 










365 






275 


Glu 


Trp 


Lys 


Glu 


Gly 


Phe 


Gly 


Pro 


Arg 


Phe Gly 


Leu 


Val 


Glu 


Val 


Asp 


276 




370 










375 










380 








277 


Tyr 


Gin 


Thr 


Phe 


Glu 


Arg 


Arg 


Pro 


Arg 


Lys 


Ser 


Ala 


Tyr 


Val 


Tyr 


Gly 


278 


385 










390 










395 








400 


279 


Glu 


He 


Ala 


Arg 


Ser 


Lys 


Glu 


He 


Lys 


Asp 


Glu 


Leu 


Leu 


Lys 


Arg 


Tyr 


280 










405 










410 










415 


281 


Gly 


Leu 


Pro 


Glu 


Leu 


Gin 


Leu 




















282 








420 



























284 <210> SEQ ID NO: 6 

285 <211> LENGTH: 483 

286 <212> TYPE: PRT 

287 <213> ORGANISM: Pyrococcus horikoshii 

289 <400> SEQUENCE: 6 

290 Met Lys Phe Tyr Trp Gly Val Val Gin Ser Ala Phe Gin Phe Glu Met 

291 1 5 10-15 

292 Gly Asp Pro Tyr Arg Arg Asn He Asp Pro Arg Ser Asp Trp Trp Tyr 

293 20 25 30 
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